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Antigen-combining sites of antibodies are 
from VL and VH domains. The third hype 

chain is far more variable than the others sit length, sequence and struc- 
ture, and was not included in the canonical-structure description of the 
conformational repertoire of the three hypc-t variable regions ol Vk 
chains and the first two of VH chains. Here we oresent ;ui analysis of 
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of K3 structures that have more than 
and 10-IGly. {1} the conformation of the 
due 101, and (2) the torso does not co 
the regular hydrogen-bonding pattern 
choice of bulged versus non-bulged toiso 
mariiy by the sequence, through the 
between the side-chains of an Arg or Ly 
at position 101. Thus the torso ' region 
repertoire of conformations, 
other attfiger ' 



•f the 



sdnes 



> 92Cyr 



mthmes 
(J^heet hairpin, The 
^formation is dictated pre 
rmation of a salt bodge 
at position 94 and an Asp 
ppears So have <> limited 
ionics! structure model of 



hiding loops. 

The treads or apices of she ieops have a eery wide variety of confor- 
mations. In shorter M3 regions, and in those containing the non- 
bulged torso conformation, the heads follow the roles relating 
Mtquena; to structure in short hairpins. We surveyed the heads eh 
longer H3 regions, hnding that those with bulged torsos present 
many very different coniormaiions of the head. 

We recognize thai H3, unlike tier other five antigen-binding looses, 
has a conformation that • ' ironoiera, an 

we have analysed the interactions of 133 with residues elsewhere in 
the VH ?om in in th( VI hum., i ' /iti food- md i ei 
effects oo the conformation of H3. We tested these results bv 
attempts to predict the conformations of H3 regions in antibody 
structures solved after the results were derived. 
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Introduction 

The antigen-binding sites of antibodies are con- 
structed from six loops between strands of the 
jksheels in tht I i mains oi the light and 

heavy chains (Figure 1} The light-chain loops are 
called Li, L2, 13 and the heavy-chain loops are 
Ml, m, and H3. Five of the six" loops have been 
shown to krv< j small discr ' ' rtoire of aiain 
chain conformation determi i niy iku ke> 
residues in < work, calleci 

canonical structures (Chorion & Les'k, 1987; 
Chothia el «/., 1989; Tramontane « 1990; 
Bri.inger ef 1991; Wo & Cycler, 1993; Martin & 
Thornton, 1996; e\i da;; than, as k, 1997). Other resi- 
dm m lb s k > s < so mt did u 

1 ' i f i tu o 

pattens from which the main chain conformations 
of the loops, can be recognized in the sequences of 
antibody structures in "blind tests" (Chothia el al, 
1986, 1989). The canonical structure model has 
been established for the three hypervartable 
regions of VL chains (LI, 1.2 and L3), and the first 
two of VH chains (HI and H2). 

The third hyper, ariable region of VH chains, or 
H3> includes the region between the conserved 

<wid t t c ru ' 1 1 1 i HI i In n 

vious work we defined the H3 loop as residues 96 
to 101 of the VH domain, as these were the rest- 

t • ogen bonds 

interactions with residues outside positions 96 to 
101, and the cons rved n .it • >92 and CJlylM 
prov-de useful landmarks to identify this region in 
a new sequence, we trot the extended H3 region, 
residues 92 to 104, here. 

The H3 region is much more variable in length 
and sequence than the other antigen-binding loops 
(Kabat cl rJ., 1991, Rock rf «/., 1994) Because" it falls 
in the region of the V-D-j join in the assembly of 
the immunoglobulin heavy chain gene (Figure 1(e))., 
several rm vhar sms u>i net tion o 1 

diversity, including selection of VH, D and I gene 
segments, iind alternative splicing patterns", in 
expressed antibodies, H3 appears ptommently at 
the centre of the binding site, and interacts with 
die VI. domain as vveti as with other oatts of the 
VH domain (Flame 1(a)). 



t Residue numbering is identical to that of Kabat d ol, 
tl'MP >W| I oi - < i h Hid m 

k lie vine, C.'hi & 5 ' - ' ibui n n of 

t tt t * 1 ^ i lied 100! toot 

and ItSOK, respectively. The nomenclature used here is 
consistent with recent lit > letimes we shall 
refe 'top « f 1 i t t , < f residues 
> u d 1 ' , a ' t . < r 5, to md t 
position 4 after Cys92 or 5 before ClytOS. When it is 
' ' i > tij i.nd heavy chains the 
■ M 11 : h „ i n ' c , itheat 
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Given the central position of the H3 region in 
the antigen-binding site, there are significant u n 
t tu t -. v, itt t fliLt h n i> , well as. wnh t iu lumi- 
work, with the light-chain partner, and with 
Uganda, that pofenhalle influence the conformation 
of K3. However, we show here that in cases where 
the antibody changes conformation in response to 
ligatio the structure or the H3 region in th 
bound form is regular. Tins has implications for 
testing predicted models of antigen-binding sites; 
against crystal structures of imhgsteei antibodies. 
Pc-i et a!. (1997) have solved a structure containing 
the VH domain of antibody 81-8 combined with 
two different Vi„ dnrnanis, sheaving two very 
different conformations of H3. This important 
observation implies that general rules governing 
the conformation of Ho must involve interactions 
outside the local region 

nk-verthciess, some of the bio regions are short 
hairpins, and their structures should follow (.he 
rules governing secpience-confosmaticn corre- 
lations in short hairpins < Venkaiachalam, 1968; 
Sibanda & Thornton, 1985; Rose et at., 1985; 
Chothia & beak, 1987; VViimol & Thornton, 1988; 
Sibanda ei a/., 1989). The conformations of short 
hairpins depend primarily on the sequence of the 
residues within the loop, notably on the position of 
a Cly, Asn or Asp, which are more frequently 
observed to have conformations with positive 
values of 4> and i<, or ;s Fro which can more easily 
accommodate a as peptide. Such loops have con- 
formations that are determined locally. However, 
many of the H3 regions are longer than the range 
covered by these rules. 



Previous work on the conformation of H3 

Chothia & Lesk (1987) described determinants of 
M contt n 9i v 1 fl m ih i < , 

rng loops. That paper focused attention on two sets 
of residues: 9<IArg and lGlAsp, which form a sail 
bridge, and residues 100b arid 100c which pack 
against the VL domain; the significance of the 
94Arg-101Asp hydrogen bond was noted, and was 
included in a set of conditions on the sequence for 
predicting that the H3 conformation would 
resemble that of McPC603, which contains the 
bulged torso region. Tins set the stage for sub- 
sequent work, wbaeh develooed as rnore .structures 
became available. 

Novof.ny <?< ni (1990) and Seabe et ci 099*; also 
noh i the'94Arg t g I open bee d Mis e, >/ 
(1992! described two conformations of the proxi- 
mal residues which they called "kinked - ' and 
"extended;" but did not infer the choice of struc- 
ture from the sequence. Rees et al. (1994) and 
Searle el M. (1994; proposed an empirical classifi- 
cation of seven types of H3 conformations, but this 
cannot survive the appearance of the structures 
with the variety of conformations now available 
Martin & Thornton (1996) proposed conformation- 
determining residues for • 12-residue H3 loops 



I i ps in their notation), 
suggesting unspedhttd rotes for residues at pos- 
itions S9 M and 101 but their method does not 
identify the important 1 p interaction. 

>'ork ) ;t s«l> dated m approach and 
results to our own, Shtra (1996) have ana- 

lysed the conformations of 55 H3 regions, They 
distinguished bulged from non-bulged toixo 
regions (which they call kinked and extended),, and 
proposed rotes based on sequence hi j idfat 
which wii! appear. Although they did no! precisely 
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define die extent of the region of conserved struc- 
ture, ihu>e rah n , > alent to ours fhej noted 
the second bulge, appearing in e.g. Sel55-4. They 
< >rrecd> observe the hps of the i oj s 

1 til 
ficafion of the more re , based on 

established roles tor relating hairpin conformation 
and sequence; but f.hey only mention the extent 
and variety of irregular structures. In particular 
■hey do not observe tlx- disUnction between two 
types of non-bulged torso regions (site below.!. 




( f i t General view of h „ >eu i i i site 

f hah TiB3 Hi i t h 1 m Mr 1 ht 

chain hypervariable regions Li, f.2 and L3 in red; HI 
and 112 in orange, Hi "spotlit" in magenta. Antigen in 
yello* tejSe.h i diagra f the H region lltt trat 
ing the residw rienelai t beri'n? This di a 

0 r m shows f « H m c.e 

McPCofB eh , ent hydiogen 

bonding patterns. 'The >csidue numberim- ae.rees with 
that of Rabat *t o! 09911 except for residues 100a. tOtSb 
and 300c which are labelled TO i, HW J, and 190 K. 
respectively in Kabat c; rrr p.S'iht. (c; 113 appears in the 
region of the V-D-J join m the assembly of the heavy 
chain gene. This oicfore shows the VH domain of 
McPCbOa flmcp), with the residues cnmr;pce»,fia< ; tu 
the V segment sn red. She D segment m yellow and the ( 
segn ent r green Ps f i 1 n t > 

coincides almost exactly "with the H3 region as defined 
here. 
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They also examined the differences in confor- 
mation between Hgated and unligated structures. 
We note that where our investigations overlap our 
results are in complete agreement 



Modelling, engineering and database searching 

Appeared in the context of the problem of model- 
ling., engineering, and design of antibody combin- 
ing, sites; in many cases, the goal has not been to 
derive empirical sequence-structure correlations, 
but to model the loops by ab initio or database 
searching met c . co'leri H a! i- 5 ?^ Martin 
et at., 1989, 199V, Rees et at, 1994). The ab miiio 
methods have the power of generality, and have 
often been combined with careful observation of 
the known structures Moreover, rules that can 
produce models for the c • f< m it n of the proxi 
mal i ssidues « f th< 1 < npii 
blt-m of completion of the loop by sb initio 
methods i-rom our point of view., however, the 

t tic' oi'tli i i s n 

hal for H3 conformation, and (b) the sets of 
assumptions that are required for correct predic- 
tion of loop conformation. 

With this motivation we have also carried out 
database searches for antigen ndin 
(Tramontano & Lesk, 1992; Rustici & husk, 1994). 
'We report here database searches for H3 loops in 
other known protein structures, in order to deter- 
mine the extent to which the conformations of H3 
regions are determined by their sequences, or by 
the panoply of interactions. We searched a data 
bank of well-determined protein structures for 
regions that matched, in conformation, the frame- 
work residues tanking h< i In these cat 

cuh lions we used the e i le the loop to 

identify stretches of polypeptide chain in welt- 
refined protein structures that bridge the selected 
endpoints, and checked for the presence in she 
sequence of special residues that can take on unu- 
sual main-chain conformations and thereby often 
deterroixu; loop conformation. The pattern of 
appearance of these resi iu« in t e mence oi the 
loop is necessary to predict that a loop identified 
from dafeh 5 < 1 eve ine saint confor- 

mation as the target toop. 

Because no n! rmati >o jbout the interaction of 
the loops with e ii t lin - used, success- 

1 f t t c J i H 

iiiation is determined bv residues in the region 
>tse.s and su^ectne ot the - .-k.-c of nde& that 
relate sequence and structure for these cases, even 
it these rules cannot yet be succinctly enunciated. 
(As an alt.,) <• t -rung, other 

investigators have generated conformations that 
bridge fixed given endpoints ab iniiic, without 
reference to known protein structures: see, e.g., 
tk-uccoleri el ai (1988}). 



H3 Region of immunag 



Summary of results; 

Here we present detailed descriptions of the 
tructures of I > • n in tibc d i true- 
Sores and ana i - t mutants of lei or 
formations. Certain features of H'J conformation 
have become clear; in general, the closer to the 

frame work, :h, s . <■■ ;■ > - re . '."-a- | 

residues of Shi region (set- Figur« lib)) the first 
ma jot choice is at residue 101, which can either 
form a (3-bulge ot continue the regular pattern of 
5-sheet hydrogen bondmg; this distinguishes 
bti d froi 1 non bulgi r j 

cases a second bul s *"is formed by residue G-5. 
Residue 100b, the side-chain of 'which usually 
packs against the V, domain, is the site of con- 
siderable action: If it is a Gly, as in KOL {2fb4>, the 
packing against V L is affected. In some cases, the 
side-chain of residue 109b points into its own 
domain instead of across the mterdomam interface, 
affecting the size and shape of the antigen-binding 
pocket. 
Our main results are: 

(1) Clarification of the distinction between the 
torso region {which appears to have a limited 
repertoire of conformations as in the canonical 
structure mode) of other antigen-binding loops) 
and Hi n a t p it <r In i i N in i i ( 
sequence-based rules for deciding which confor- 
mation is adopted (similar to those of Shirai el si. 
i I 

0 \n il) i ht rn i int I t 

Hi and other VH residues, regions of the VI. 
domain, and ligands, and statement of the possible 
varieties of H3 conformation that can arise. 

0} A survey of the conformations of the heads 
or apices of the loops in well-determined Struc- 
tures, in relation to the structure* of the torso. 

(4) Database searches tor loops, in order to clari- 
fy the rules of local interactions and to provide a 
-I. region 

(5.1 Tests of the conclusions derived by evaluat- 
ing thorn efficacy m predicting the conformations of 
H3 loop:; in antibody structures solved after the 
results were derived. 

These conclusions, based on the many new high 
resolution structures now available, extend the 
nsufts of our earlier paper (Chothia & Lesk, 1987). 
A preliminary description of some of our results 
has appeared in Mores el ■"'<■ (.1997). 



Coordinates and Calculations 

We assembled a non-redundant set of antibody 

i >. , i i 1 ul lu 

only structures of resolution £2.8 A which are 
complete in the region of interest, i.e. no missing 
residues, and he dep I iruclu is not broiled' 
to C* atoms {see Padlan, 1996). In some cases two 
or more independent structures am available, from 
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lolecules in the same unit cell, or from different 
rructure determinations of 'he same protein. If the 
* (rrns) deviation of the backbone 



!'N, Cfo C, O) atoms of two o 
<. 1 .0 A, we retained only o 
tare determined of the high 



The m ) 



, of t 



mi 22 



such regions 
chosen from a stru 

eeted sir 

residues (Table 1/ 
i used had lengths 



had 



lengths betwi 

Thirty-two of toe it structur 

using insightll iosym < up.} and b> programs 
written by one of us (be.sk, logo, and references 
contained therein). 



Classes of H3 Conformations 



Analysts of common stsuc total patterns and the 
interactions responsible for them suggests consid- 
ering (he region proximal to the framework separ- 
ately from the tip of the loop. These regions can be 
thought of as the torso and head of the structure. 
Many regions of chin. : i ! 
torso structure although they differ at their hps. 

H3 conformations can be divided into three 
classes (cf. Ohothia & Usk, 1987): 

(X) Most H3 regions have a toaso with a p-imfge 
at residue 101; this torso conformation can be 



a febtoge, hot a regulm fPsbwrt hairpin structure. 

;3} very short lit regions lom a separate 

class, of short hairpins. 

To make precise the distinction between torso 
and head, and to identify appropriate classes of 1-53 
conform;) lions, ore superp, sed t'h; ! ackbone atoms 
of different selected subsets of the H3 region, and 
used the matrices of pairwise mis deviations as the 
distance measure in a clustering algorithm (a 
modified en - < up method). Using 

four residues from the N terminus of the region 
and six from the C terminus, we obtained a single 
major cluster of structures with a common confor- 
mational pattern, for which all pair wise rms devi- 
ations for main-chain atoms are < 12 A (Table 2). 
We define these ten residues as the torso oi the 
region. The main cluster of torso structures has a 
p-bylge at residue 101; the other structures have a 
regular ft- sheet structure. We will refer to these as 
the "bulged"' and "non-bulged" torso confor- 
mations. 



The Bulged Torso Conformation 

In this sevt.cn vvt d [uence moiifs 

associated with the bulged torso conformation, 
desalt 5 the tr r I st e it and * n, 

i . cause vari- 

atio.ii in the structure. Identification of the 



interactions th 1 stabilize « [formation 

reveals rules relating torso cor; formation to 
sequence Figure 2(a) shows the bulge and two 
interactions that commonly stabilise it. a salt 
bridge between Argpl and AspKH, and a hydro- 
gen bond between the side-chain of Trp'103 and 
the carixmvl oxvRen of the residue at the third pos- 



tihdut 



!00c - 



-1). 



strongly but not absol- 
utely conserved, and when they are present « 
bulged conformation is observed..' (Antibody 17E8 
(leap), solved after this work was carried out, con- 
tains an unusual residue a position 93, K. The 
sequence of its H3 region is CKRSYYGSSYV- 
DYWC. in this slmciuw* a salt bridge is formed 
between 93f,ys and lOlAsp. which permits the 
torso to adopt a non-bulged conformation; this 
exception is discussed below.) I.ys94 replaces 
\ig*4 !n p, i U 1 Lr .1 HI mo in, ,fo 

ing Arg9-'l but containing AspiOl, eg. JBU03 
(Imrf), ' show a non-bulged torso structure 
(Figure 2(b)t. The Asp, deprived of its preferred 
nar'iner, forms a hydrogen bond to the side chain 
of TrpHto, precluding'' formation of the bulged 

lb the I ■ ? stabilised ad 

bv ! > he residue in 

position G-4, as in fS (2ft 1 n ore 2)(c). 

of the heavy-chain gene, pos- 
ition 94 arises from foe V segment and 101 from 
the ) segment Of the 51 human VM germ Sine 
genes (Cook & Tombnson, 1995), all but five con- 
tain R or K as position 9-1. Of the six human JH 
germhne genes H onegawa, 1983). all but one con- 



interactions within H3 

Shivai et al. (1996) and Moroa el a!. (1997) have 
stated rules for predicting torsi ! conf-stsnation from 
the sequence of the H3 region. When Asp 10; is 
pn ent without a post I > ■■ side ham a! 

position 94 to form a salt bridge, the regions have 
a non-bulged conformation: in other eases, they 
have a bulged structure: 








bulged 










Of iBtigCi > 10 



During the course of this work an exception to 
these rules appeared: I7B8 (leap), in which an 
alternative salt-bridge, between the side-chains of 
93Lys and 101 Asp, is formed. Shirai el al (1996) 
extended the rules fo include this case, stating that 
if AspiOl is present and Use residues at positions 



274 



113 Region oi , i , , v . > 











19 


JO 




li^b 
1 fli- 




20 






nt r 




24 
IS 




NCo.S 


2egr 


2,2 


21 


13 


D)3 


ivfo 


1,8 


3ft 








2.45 


22 


l* 


ktg ' 


ibbd 


2.8 






gags 


liM 


2.4 


21 








1.9 


W 


14 










H 


■-r;r.-. t 


Irrdb 


XI 
















R6.S 


limf 


2.8 










2 75 


\7 


15 


2*- 30 




2,5 


18 












13 


BVM-01 








15 










i5 


B1332 








16 


4DS 








lt> 


405 








U 


4D5 


< ' 


23 




lb 












NC41 


liica 




. w 






1% 


2j8 






37/S 








i? 


US!. 


KfA, f ;G 


X3 






H'i. 


• ; 










If 








R5 9.v 






19 




KOI.. 










MS 


j<jfe 








lAl^GA! 






16 


"tow et «!. (imr, 


^Alzari cf el, {IW 






{1993}; "GuJdat et a! 


. (1994); : 8hs( ft s 






&bV.»tpfc C>»2); G-M; 


















AAA!) 




1W21; '-*Rw rt ,tf 






(1992S 


; **<atojiMrdr i 


si. (1980); '*Ha 


ct ql \ywry. «aj 



CARD AGAYWG 

CVQE — --C.Y-j.VWG 

CVQE GYIVWG 

CTGSVY --GM0YWG 

CAN LEG YPDYWC 

Ci.HGNY -D"DGJvC 

, YCOVWC 

CTRGYS SMDVTWG 

CAR? - t, , , 

CTRAPY Oi=AfiywG 

CilGVVC YVDM DV WG 

CT3< DDK V GAKOYWG 

CARIA1Y YGYtiAVWG 

CA1A;LI- - - i.G.-. 

CTPCCH- ■- - -GYYGDYWG 

CAHVMG ECATDVAG 

CAPGGWG- AGP. GAY AG 

CAKbGEA- PYAAAYWG 

CAGSSGN-- 
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' ( 
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' ] WG L TB ii 

( t-uilNi ff ,t.'. |K4 1 , „tl A. (GAG); j i 'urn 
'92;; GFuiaiTA A A (!>>,< / < n986): GAui 

F.H. of UiWj ' i -v ;A Gr n ,t . 

w ff A, G28A;, JT„t,p A A, GAAP. 'Oim-JuSi f » . ; . 
!f. am); ^Siawajd ei a. (W4); •'Wnnbc rt .if. 



17ES k-ap 3.5 I" ' ^ . - j t. Ifregi 

' ' . i>i; I ■ . i - - - _ ' t. T33 

40-30 lib? 27 21 CA.Af i:G AS Y i'r>Y AViVfWG I TB 

S.5MA16 limKru! 2.6 20 CAAG1G2AYA Y'AYVLDYWG TB 

NCIO iiimb 2.5 Si CARSGGSYR YDGGFDYWG L T8 

QPG2 Jopg 2.0 Ti5 



G i< f n ' f i \ s , i i 1 , f J > , f g * 

vMalb-y tf A. fAGG), "J^a^ca^n A G^G) 

A it i< Mif ^ c> by ui t n;2 refeis f >n i! ^ n ! o II i i I!\ < u >] u i i k i . rt „ t , I 

Kvcml refers t ! 1 f t \ t - - « 5 U f 

forso conform itioi i - m m ill i«c textj 



labk 2 ' a ms ol four Is, terminal and six C-terminal rescues from SXi rceten;.. 

92CXXX...XXXXXG1W 



Jfvdml lfvdm2 



Mini Iggi! )mtl Imam 2i.it Uir 



Hvd.ni iivom2 



imcp 7fab 

2egr ldfc* 

ifijjjuni lijiiml 

lib! 2fcj 




tf&im'l 
Ihitmi 



2.53 1,43 1.-12 0.00 

5,05 5.34 3.43 1 52 1.13 0,(10 



93 and 94 ate basic, the structure is non-bulged. 
Although only one example is known so for, this 
rule is liked v io be (rue, because the residue at pos- 
ition 93„ is generally buried, and would be unli- 
ke!) to cot i v i charge 

Other than 17E8 (leap)., among the 3? struc- 
tures v stt died there 
are on!\ thoc utr except 0 (ligi tig] 
molecule 1, and ligi molecule 2.1, Yst9.1 [Imaml, 
and one of the structures of 17/9 (lhiJ). The first 
two were also noted by Shim cf */. (1996). Anti- 
body 20-10 has a bulged torso region although it 
contains AsplOl and a (lie at position 9-1. In mol- 
ecule 1 of 26-1.0, coordinate set ligj, the aspartate 
side-chain forms a hydrogen bond to the OH ot 
Tyr27 of HI, which, in "the absence of Arg94 
replaces the expected interaction of AsplOl with 
TrpIOo. In the HI of 26-10, fhe29, generally 
pointing into the doma > . 1 ! "til, points out 
into the solvent; and indeed the conforms don of 
HI of this antibody is peculiar and cannot be 
related to the - tun £ uij thei known HI 
loop. In this structure the Mi" region is involved 
in crystal p<" 1 n contact 1 ch t ay be distort- 
ing the molecule from its solution structure. 

Yst9.1 (.Imam) and 17/9 (Ihi!) contain Arg94 
and lack AsplOl; however each has an unusual 
structure, the torso regions of which are related to 
neither the bulged conformation not the non- 
bulged one. (The overall conformations of the H3 



regions are unrelated to each other; they are a 
different length.) The presence in Ysl9.1 (Imam} o 
a Fro in position 100b (G--b>, and many contact 
with a second antibody molecule in the crysta 



in re of Ho depends 
Arg94 present we e:< 
mation, and the bulj 
when antigen is bo: 
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sible 
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the torso conformation is irregular. In I 
state., the Rd region occupies the region of space in 
which the antigen bind;:, and iiVconforrnalionai 
change moves it out of the way upon antigen bind- 
ing (Figure 3). The occurrence of a regular confor- 
mation in the presence of antigen but not in its 
absence is also seen in antibody 50-1. Antibodies 
differ in the magnitude of their conformational 
changes in reap , i f m o ligation 

The importance of Arg94>, in determining the 
conformation < j > « th properri < 

of mutants. Panka el ai (1988) and Tempest c.t «/. 
1 1} proposed t > ta 

reshaped antibodies arises from a mutation of 
\rg94 i lit ' 1 Oil t i (Set -+A g) it 
deletion (Arg---*Ser) of a salt-bridge with AsplOl 
in both eases, the conversion ot tins residue back 
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Trp 5 00 x y Trp 103 




Figure 2. (a) Tiw typical bulged 
torso conformation, fro::, antibody 
TE33 (Iter), stabilized by th« salt 
bridge between Arg9$ end AspJOl, 
and She hydrogen" bond between 
She side-chain of TrpKB and the 
Garixwyi oxygon of PhiVlOO K, throe 
tosidoes before it. (») Non-bulgec; 
torso region (ire./l). Note the tmw- 
ac&ro between D10) and TrpJOS. 
ie) Stabilization of the belged con- 
formation in }SS9 ObbjY which con- 
wins Arg94 bvt not AsplOl. The 
TrpllB hydrogen bond appears, sod 
Arg9'l forms hydrogen bonds to the 
t . t 



to its original ; 
affinity. 



> acid significantly restores 



i H3 and the H1 loop 

> of HI are aromatic res 



hy 



Who: 



opho 



1-13 



than id e- • ' b o m (i op 
an Arg is present in position 94 and the long 
ivdrophobic { I i ion of its side-chain 

packs against the aroma tics at positions 27 and 



32, in 

mattes of HI pack with an aroma; tic residue in 
position 96 or 99 of H3 'This alternative packing 
produces two different con forma dons of these- 
loops, discussed below, 

interactions between H3 and She :igM chain 

Residues from the C-temsmal part of the H3 
region interact wslh the light chain (Chothia c( al, 
19? * otntti» \ iff the t: t f 
eat heavy and light chains can dimerize promis- 
cuously, some of these interactions recur in many 
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.structures. 'The VL-VH interface involves a sheet- 
fo-sheet packing indudhig residues from both the 
framework (about 3/4 of the interface} and the 
hypervariable regions (about 1/4 of the interface). 

''Hie structure of TE33 t man 

ber of the common interactions between H3 and 
the light chain (Figure 4(a)). Residues near the C 
terminus of the H3 region of T833 (Itet) pack 
I - ijac< nt u 1 1 »d 13 of i light 
chain. lDObfjY packs against 49 t .Y of the light 
ham ,1,1%,,..-, 1 to 

" Y and 99 L P, just C-tert 



na! to 



wain chain N of residue lt!0c„ (G -4) o 
a hydrogw bond to thy OH of the s 
:>i residue 3th. Y, in the framework sir 
ilia! to 1,1, and iust C-ferorina! to I be ■ 
Tro, Residue 36, is Y in 



Stu- 
died except NO10/12.5 and 17-Ia (Ifor) in which it 
is ¥ and CHA255 (find), Sel.55-4 (imfo) and If CI 9 
(igig) in which it is V (it is also V in HyHEt.-lO 
dbs >}). Two 1 k tive h drogen 

bonds: in UC19 (Igig) the side-chain of residue 



The 36 L Y OH-N fOOi:,., !i ire ;n bond appears 
in a!! eases of H3 regions with bulged torso confor- 
mations parted with light chains in which residue 
36 t is a Y, except for frV'04-0.1 (Inbv and icbv), in 
BV04-01 {Inbv and lebv) tire carbonyl of residue 
K.H}b; s points opposite to its usuai direction 
because of 3 change in peptide orientation, and 
alternative hydrogen bonds from .'S6 t Y Old to the 
carbonyl O of 100b,., (Inbv) or to the- carbonyi O of 
HKkv,('k-bs1 lins i , , b\ u d du e 
in orientation of the side-ebonrt oi residue 100b, v 
which is discussed below 



Structural Effects of Sequence 
Variation in VL-VH Contact Residues 

Some of the contacts are conserved and we cm 
not therefore deride whether they tire essentia! f; 
the structure. In some crises the •contact partes 



Table 3. rms deviations of backbone 
gated states of the same antibody 
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changes but the 113 conformation does not, 
-m^gMing th.st thus; mheiKcious arvnmi cssi.-niia! 
In "the case of the interactions of residue 100b 
changes in interactions are correlated with changes 
i.n H i conformation. 



Effects of sequence variation in VL-VH contact 
residues: residues on the VL side 

Some of the VL-VH contact ppeot in many 
antibody structures, and the residues that form 
thorn fend to be conserved, In the case of the 
4V10Gb„ contact: residue 49, is Y in all Fab 

I foi the and 

cyttepomi amibody R45-4M1 flikf), m which it 
is f-\ HyHEL-10 (3hfm) m which it is K, and 
Cha255 (lindX HC19 (Itjit'.J and Se!95-4 (Imfb) in 
.•hi, h >i is G Of these HyHet-10 ( fm) and 
Sel 5 i (lib) i it pc tio 100b,., the 

H3 region of Ch.v255 (hud) is too short to permit 
a norma! 49. -100b, ; packing, and RC19 has a 
normal 300b„ conformation. In the case of the 
contacts with residues 55,. and 6% residue 55,, is 
P in approximately a third of the known struc- 
tures. Residue B9 t is often but not always hydro- 
phobic but is deleted in certain antibodies. No 
correlation between the conformation of H3 and 
the residues at positions 55, and 89, :s observed 
Residue 99. js F in all known ?ab structures; and 



Effects of sequence variation in VX-VH contact 
residues; residues on the VH side 

An important factor in determining the confor- 
mation of H3 and {he shape of the antigen-binding 
pocket is the conformation of residue lObb,. (G-5). 
the i ;khu ju t N-lernHn 1 tt th« XI WG t of.ii 
at the C terminus of the H3 region. In TE33 (t tot), 
residue 100b H is a Y, the side-chain of which 
points to the right {in Figure 4(a)) and packs 
against 49, Y of" the light chain, adjacent to die 1.2 
region, isi certain other structures, the side-chain of 
this residue points in the opposite direction and 
sacks against 13: for instance OVOi-fJi ilcbv, inbv) 
(Figure 4(b)), which contain:; the light chain residue 
4s\Y but in which the side-chain of the heavy 
chain residue 100b„W packs to the left (in 
Figure 4(b)) against the 13 region, and residue 
liJ0 H T packs agau f -1 < id • ucturai features 
are reminiscent of the skle-ebam of F'he B25 of 
insulin, which in molecule 1 packs into its own 
oxonomer but in riolecui • out across the 

diroer boundary, in both 2Zn and 4Z» forms 
(Chothia et id., 1 983). 

Residue 100b w is often a large hydrophobic or 
Y. In KOL (2fb4) it is G, but in this case the bulge 
is regular with residue Fhe lOOa^ instead of 100b«., 

r t , - vu n t 1 i '1 Hb , > t< > <«-> fru 
',rit t i 'e- is also observed in C3 foi 26/9 
il k 1, J ,lhl I \ i ,U if o J 



{Irntb) residue G-4 is a glycine, and in this case a 
second bulge is formed at residue G-5. 

The position of the side-chain of residue 100b H 
fixes tl t 

and constrains the direction the chain takes in 
forming the apex of (he 113 region. Thus fhe con- 
formation of residue 100b,., afreets the sice and 
shape of the antigen-binding pocket. in some anti- 
bodies, part of an anhgen packs between 1.3 and 
H3, U this region is closed off, by the packing of 
100a M or ltjOb H igainst 13, t tads t lie 

on an almost-flat surface above this region (see, 
e.g 8V04 01 (Icbv) in , , 

the conformation of residues 100a H and 100b H is 
svet s to la ivt tin sp ice i u in in tig i 
bury itself quite deeply, as in 813)2. (iigfml) 
(Figure 4(c)). 

The cot lorn i esidue 1 t , is not in gen- 
eral art effect of ligation: in two available examples, 
the conformation of 100b„ does not change with 
state of ligation. For SV04-01 (lebv and Inbv) and 
DBS (Jdba and Idbb; structures of free and iigated 
forms are known In U i lit ictures fret 
and Iigated, the side-chain of residue 100b M W 
p Mnts fcn <' fret aru 

Kgated, the side-chain of residue 100b„Y points 
towards L2 Although the conformation of li3 and 
the VL-VH packing in BV04-01 do change with 
state of it t i these changes do it affect the 
conformation of residue t00b,.. 

For the side-chain of residue 100b,., to point 
towards 1,3, It mttsi occupy a region of space' near 
(and often crowded by) 13 or by a ligand. fo con- 
trast, for this side-chum to point" towards L2, there 
is generally room for it in the region near residue 
49," (usually a Y). Several possRde effects may deny 
access to the region near L3 to residue iO0b M . fn 
T1733 (Hat) and Sel 56-4 (Imfi-.h 8 residue a; the 
C terminus of L3 (104. r in TE33 and 101, W in 
Se155-4) occupies the space that residue "l00b„ 
•would need if it were to point towards L3. In DB3 
fldfa} and Mcl-'CtUW (Iincpl the residue from the 
N terminus of U {W,S in OB3 and 97 ; D in 
McPC603) occupies this space D1.3 (tvfa) and 39 I 
(lacy) ere ot.her examples, in Re. 5 (trmf) residue 
lfiOb H is "ti it er) an it erica! indei 
despite the occupancy of the space adiacent to 113 
by residues from 1.3. 

It appears from these and other examples that, 
when residue l00b w is large and there is run 
foi residue 101 H to p« ft wards 1 * t kn * si 
when thsre is no room for it to do so it (necess- 
arily) points towards 12. That is, we have seen 
no case in which there is room for a large resi- 
due 100b M to point towards L3 but it does not 
do so. 



Non-Bulged Torso Regions 

We distinguish two structures of non-bulged 
t > > i i 2d ( i -) , t . s - a ; 

« ucn m f yi-1! L 2htl) and JEU03 i 



have (he ot 1 ictures an aro- 

matic residue from 'f 13 packs against residues 17 
and/or 32 of HI. In the first class of structures 
(4-4-20 and 6 t i J sidoe is in position 
96, whereas in the second (FEL1Q3 and HyHEL-5) it 
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regj. n * t < . iu es Pigoi " ;h> vs 

the H"3 regions from JEL103 (Imrf) and 4-4-20 
(Iflr), superposed on the residues nearest the 
frai lework. The H3 rcj i F ibbdt ts simik) 
to that of 4-4-20 (if!r): 4-4-20 has length 12 and 8FS 
has length 14, but the rros deviation of the back- 
bone atoms from all but the central two residues of 
4 i 20 from the < >rn pond t rr.s in 81-5 is 
0.9 A. 



Conformation of the Head of the H3 
Loop in Relation to Torso Conformation 



H3 regions with bulged torsos 



IP-Residua H3 regions 

There mo five 1 1 n* kh e 
according to the rrns ci < > >' i I 

>ti «s i o font lasses (Tabit 1 ' I 

NC (2cgr) have bu t 
overall co form > gure 6 sh s the superpo- 
sition of the B3 regions of TR33 (t let) and NC6.8 



Tabte 4. cms da 
N-texmina! and ; 
regions with non-b 



iS'ftetfdue regions 

There arc two 13-residue H3 region: in 03.3 
t'iviV) and YsiTl (Imam). The H3 legion of 131.3 
(Ivfo) has a regular bulge and a four-residue turn 
at the apex that has the sequence and one of the 
structures expected for the sequence XXXX 
(Chothia & Lesk, 198?}. In specifying sequence 
motifs we use X to represent a 'Wild card" - a resi- 
due that can be any amino acid except proline. 
Figure 7(a) and (b; shows the structure of the H.3 
region of 1vfa, and the structure and main-chain 
conformational angles of the apex. (As in 
Al-Laxskani <:t ,v. (199?;, we illustrate the structures 
determined at a resolution of 2.3 A or better.) 

The H3 region of YsW.l (Imam) is very different 
from that of D13 {1vfa}, with a bifurcated apex, its 
seq nc< nt 11 two pi >line re (dues and one 
glycine. Figure 7(c) shows the structure. 



The six structures with 14-residue 133 regions 
include five with bulged torso regions and one. 




Table 5. rms deviations of backbone atoms of 12-residue 
f ].'i regions 
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gP5 (Ibbd), without the bulge-. The five bulged 
regions have similar torso conformations (the nns 
deviation of the backbone a loots, is «0.78 A), 
except (or • >1f i •.;•:!:. 
G-4 (Table 6 A). 

iwi tnusua t n in ^l' a d me in 

VI... may be responsible for the different confor- 
mation of Sel 5^-4 f irafb). Hesidue Li -4 is a >■>•- 
doe in Skri5C>-4 (lmfb), the only occurrence of 
lb,. ,. u. a' tl ■ ->, turn u, an immunoglobu- 
lin of known structure. In all of the other bulged 
bops, the side-chain of the G-4 residue packs 
with the conserved TrplOS; in Jrofb, the side- 
chain of Tyr in position G--5 replaces this contact 
Pigm ? 

In most light chains, the residue C-termsnal to 
the conserved Trp35 L is a Tyr, the hydroxyl of 
which often forms a hydrogen bond to the main- 
chain N of residue '00c :'• i i; (sac abovei. 
Among the 14-residue 113 regions with bulged 
tor o this hvdiogcn burn. 1 is < 1 j ' 

(2fbi), rm.VS (iihf).. NT-WM <7fab) and Jet 42 
(ijei). In contrast, in SeioS-l (Imfo) the residue at 



position 36t is V; the main-chain N of residue 
100c forms an alternative hydrogen bond to the 
side-chain of light chain residue 3\Asn. which 
also forms a hydrogen bond to the mam chain of 
revalue 100s H Oy (see Figure 8): 

34 L Asn ND2 ■ ■ • N l00c M Tyr 
34 L Asn ODI - O IOOa«Gly 

Note that in J539 (?.fbj) but not in all others there 
is an alternate hydros nd ween residues 
X00a H and 34 L (see Figure S), 

The apices of these regions differ in confor- 
mation ('Tabic 6B). 'the apex of je!42. fljei) differs 
from that of lmfb only by a change in orientation 
of one peptide. 

15- Residue regions 

There are seven structures with 15-readue H3 

tor 'o re . ) i ' - ! i.lU n 'tin 
1 0 A' et it L i , u the kick 

ta irons of inch iuai pept , ex) except 813T2 
(ligfm'i; Table 73), Figure 9 shows the structure 
end fit i t v.t ,j tl n, - it ^.oo it m'^'ieo 
BV'04-Oi (Jnbv), the exemplar determined at the 
highest resolution (2 0 A). 

16- Residue i&gions 

All the regions of ibis length in known sin-ctores 
have the bulged conformation of their torso regions, 
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and the paixwise rms deviations of the backbone 
atoms are sS.0.9 A except for one of the structures 
of 17/9 (Ihti, rooiecule- 1), because of the different 
conformation of residue G--.3 (Ab 101b).. which is 




Figure S(a-t)> (ii 



bulged i.n fhe other regions. (Table 8A and 
Figure 10(a) to (c>). 

Ihe apices of these bops have different confor- 
mations. ("Fable 8b and Figure 'iO(d-f)}. 

17-Rmickte regions 

There are four 17-resiuue H3 regions of known 
fracture (Figure "11(a) k> (d) } The conformation of 
the bulged torso of ai! these regions is similar; the 
maximum rms deviation of the t .da ■ i n 
0,69 A (Table 9A). However, the aoices are differ- 
ent (Table 9!3 end Figure life) to thj.) 

Regions longer Shan i 7 residues 

The 2ft- and 22-residue H3 regions have the 
same con forma ft on of the bulged torso (nm devi- 
ation of backbone atoms <05 A), 

Heads associated with non-bulged 
torso regions 

Fate 4-4-20 (Iffr), JEL103 (Imrf), HyHKL-S 
(2hfl) have 12-residue H3 region:; and den of 
erg flbbd) has 14 residues. Their torsos form 
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regular ;mnp»ro!tei strands, without a busge. The 
four residues thar form (he apices of 32-residue 
H3 regions follow the sequence-conformation 
correlations expected for four-residue regions 
{Chotrua & Le.sk, 1987). For instance, the apex 
of 4-4-20 (lflr) is a regular hairpin with 
sequence XXGX, and the angles expected tor 
this sequence; it is a type ¥ turn {Figure 12). 
Similarly, the apex of JEL103 (Irorf) is regular 
hairpin with sequence XGXX and the angles 
expected tor this sequence 

HyHEL-5 (2hfl) also has a four-residue loop at 
the apex, hot doe<-. not assume the expected confer- 




Figure 9. The apa.< of the tS-rtsidue 113 roejoa of the 
undated form of BVtM-01 (mbv). 



matian with standard angles; unusual hydrogen 
bonds from the side-chain amide group of 
Asnl00„ to mam-chain caibonyfs of 1001> H and 
100r,j produces a distortion of ihe p-sheet likely to 
be responsible for ;ht> non-standard conformation 



the others, forms a regular hairpin with the confor- 
mation expected fur a six-residue loop of sequence 
XXXX(N or C or D or X)X. the actual sequence 
being YSYYDiVl. 
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Short H3 Regions 

Regions of length VJ are too short to cluster with 
either the bulged or non-bwlgect torso regions of 
iungef H3 regions, However, lypicaliy the first 
three KMermifiai residue'; and the last five C-ferm- 
inat residues arc simiJat in conformation to the cor- 



H3 Region of Immwx obuttr 



responding residues of H3 regions with bulged 
torsos. Three of the structures with 10-residue H3 
regions (NQ1Q/12.5, lifted 50.1 (Igijtovn and 
CHA255 thodt contain the JrploS lydroges 
bond, and !•.,. .. similar main-sham conformations: 
the mis deviations of the backbone atoms in pair- 
wise fits of those regions are s'O 74 A (Table 10). 




Hj Pugior, of irnn^Linogiot 



Figure 13 shows the similarity in conformation of 
tl st - i! t ,) , .ii tU id ic id i- to f i > of 
CHA255 Hind) and she longer, bulged torso con- 
formation of DL3 (.Ivfa). The uniigated form of 
50 i i Sggb) ha iifi » nt structure As in the case 
of 17/9," the tigated state of 50 1 idggi, molecule 1) 
has a regular conformation and the untigated slate 
an irregular one. 



Effect of state of ligation 

In most of the antibody sSreetmes crystallized 
with and without antigen the H3 region has the 
.same conformation (below 0.8 A rros deviation for 
all backbone atoms; Table 3). In two cases, 50.1 
arid 17/9, the state of ligation has a large effect on 
the structure. In both instances, the conformation 
of the H3 tovso region is regular (bulged) in the 
presence of antigen and irregular in the unltgated 




state. This throws n«vv light on the vole of induced 
fit in antigen recognition, the antibody selected 
because of its capacity to interact with the antigen 
does undergo somatic: mutations theh, although not 
preventing the selected conformation to occur in 
presence of the antigen, could destabilize this con- 
formation in the uniiga I < iti >o \y in favour of 
the different one that we observe. ' ' 

In both crystal structures of 8V04-01: lebv 
(ligated) and lnbv ( mitigated) the IS-residue H3 
region has a regular bulged torso, but a different 
tip conformation. Shirm ei si. {I'i9t>} describe tee 
hydrogen bond pattern of the apex of the ligated 
form of BVG4-01* (lebv) as regular and that of the 
or vltd form finbv) is e re< 'hi thi: scon 
Sistent with our observation that the itgated confor- 
mations are more regular, representing a case in 
which the effect is observed in the head rather 
than in the torso region. 




Figure llfa-f} (o'vcrirf twj page 288) 
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Database Searching 

Standard structures evisrt that encompass »N resi- 
dues of H3 regions of up to 14 residues long and 
the torsos of longer regions. Three of the four nori- 
buleed H3 region-; are hairpixiH with the dihedral 
angles expected fur their sequence patterns. For the 
apices of bulged regions, the situation is rather 



different in that the pattern of hydrogen bonds in 
tht torso )? not Ihnt 

the bulge, and the same rates can therefore not 
always be applied. 

One way to address the question of the extent to 
which the conformation of a region is determined 
locally, rather than varying with the molecular 
context, is to search tht; database for other loops 
that tit onto the w t t ] w 1 rn residues, or "steiYio'" 
(JorWS & Tbirup, 1986; outeiitfe cl ill., 1987; 
Tramontane f,r he.sk, 1902; Ree-.tiei & Less, 1994) 
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Figure 12. Th< t- ri3 reg-.ce in 44-20 •Iflr) 

[residues C-t-4 to G--4). Tlse conformational singly* are: 



H3 Region at Sinrrnmog'obu.'i.'i 



Table 10. ans deviation of backbone atoms of fen-r. 
due H3 vegions 
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For ten-rcsidae H3 regions, we searched a data- 
i i b l mktn p oil u structure t< 2 0 \ 
resolution) for fragments of structures tho.i match 
the 5l<:m of the loop; Old io C0-3 and G-l to G-3 
(for ten-residue regions there are two residues at 
the apex between these two stems}. For Cha25S 
(Hiiid) the best fit to the stem occurs in penicitb- 
pepsin (3apf e f bone at >ms of 

eight residues comprising the six used for the 
search and the iwo central ones gave an rms devi- 
ation of 0,4 A, The highest ,u, i - ei „t the 
ends of the regmie cdirriiriaiintr. the ("-terminal resi- 
ton i ihi i tion to 0.2? A 

mely small value, Moreover, both hnd and 3»pp 
show valuer; of 4' angles at (he tip of the loop 
similar to those expected for XXXX turns {that is, 
characteristic of turns which contain no residues 
from the set Gly, Asn, Asp, Pro that tend to pro- 
duce unusual mam-chain conformations (Cholhta 
& Usk, 198?)}, 

Does a good fit of the stem imply a good fit of 
the whole region? 1 gurc H h « " i 
for CHA2S5 (lio.d) » u. esid < region, suggesting 
that' we can trust the loop we identify by data base 
searching only if the stem fit is very good. These 

t rfora n u >g< t that r len-i idee regioi 
if would be poss:bte to predict the conformation of 
the apex of the loop from the structure of the prox- 
imal residues and the sequence of the loop. 





Figure 1 1 r >e pmxunai resi- 

dues" of the ten- residue H3 regions of CHA255 (lind) 
t n'jju nis t i cJ tors* < n for- 

mation ot D13 CHba) (broken lines). 



Figure 14. The correlation between f t of stem as o;*d in 
database searching with the fit of the entire region 
ie.und, for CHA255 (lind). 



To investigate the head-; of the fonver bulged Ho 
gions we searched the database of known protein 
structures for loops et the appropriate seoeth with 
flanking residues that match in structure the corre- 
sponding regions of the torso of the H3 region. 

It 1 l ■ 'Li.!!.,, r tie , , e 

preceding and She lines following the apices of the 
bulged H3 regions (in order to "take into account 
the unusual H-bond onttem) of length 14 residues 
or longer. We imposed a threshold on backbone 
rms > deviation for these five "stem" residues of 
0.6 A. We subsequently selected among the loops 
found in our database search those for which the 
pattern in the sequence of residues that can take on 
unusual backbone conformations matches that of 
the jpex ot the target H3 re-gun said calculated 
the cms deviation of the backbone atoms N, C" 
and ' beisw en fht lc« ! i bh.at >se md die 
too: H3 reg.on. 

In order for such a procedure to provide a useful 
method tor predicting loop conformation, the data- 
base search, should extract loops with a reasonable 
match of the fi nkini region id U l m r r tr 
sequence pattern Failure to fine the seatcbed-for 
sequence pattern would result in an inability to 
predict the loop with the current database ' but 
would not produce incorrect predictions. In con- 
trast, a high mis deviation between the target loop 
and a loop with the expected sequence pastern 
would u suit t no or rest i 



Regions of length 14 

f-or D1W5 {'ifhl), J539 (2fbj) and N'EWM (7fab), 
regions with stems of similar conformation and (he 
same sequence pattern XXXG of the apex can be 
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H'3 Region of ^afjr-.c-giotviinc, 



found and, in all cases, the tip of the target loop 
and the database hits are very simitar (in some 

ily in peptide orientation; this is reflected in the 
column in the f'oi-owmg Table labe-fed O, report- 
ing the fit of N. C" and C atoms only): 

ons dwisrtion of tor.kti-eiK' sunns 
Target J« Model *«tn apex awxf^ stm-hspe* 
tjhl J«iy 0.42 2.02 0,55 0?> 

2fbi 2ega 0.33 1.01 0.60 J .03 



No ioof with 1 » j tu (i j path m 

of the apex a;; that of to 5 f2 lid) vas found in the 
database when searching for its. stems. 

In Sel55»1 (Imfb), a glycine at position G-S cre- 
ate;. ;j deviation front the norrnni indeed confor- 
mation, 'fin-- resuitunr. ooriiorniatior, ,s fairiv 
imcommon arid in (he database search we found 
very few hits, none of vvlneh had the same 
sequence pattern Using the current database, we 
would be unable to predict the conformation of the 
apex of the H3 loop of SelS5-4 (Imfb) by the 
methods described here. 

Regions longer than 14 residues 

For regions longer than 14 residues, the apex is 
longer than the hairpins for which correlations 
between sequence patterns and conformations 
have been established. We applied the database 
search procedure to regions of length 15 and 16. 

or thr ■ i n i to i i 1 ' n il o 
Soon m the database was identified for two not of 
the fcight eases: 

turn «tevlfl!k>n ol b*&Smm atom.'. 
Target % MoM stem aww npext-O) sUsn+aycA 

SU . - >V\ Zbhk 0.95 1.12 0.97 5.05 

In the other six cases, no structure was identified 
with the proper apex sequence pattern. 

It-i region of length < base asarching 
would pi it id i mod < ' 3 (i imp) but 
no hit could be found with the same sequence pat- 
tern (jf the other lb-residue regions. 




The conclusions are that the correct loop can 
often be found ; i i base of known 

protein stre to i ha in imilat st n 

confo nation t d the , o> oat etn at 

the apex. Although in some cases no similar stem 
could be found/ in no case did the search pro- 
cedure give an incorrect prediction. This suggests 



that the conformations of these regions are most 
likely determined predominantly by local inter- 
actions, in particular by the position of a glycine. 

Tests and extensions using newly determined 
immunoglobulin structures 

on a re;a::vely large number of structures, they 
leave open two torpoitant questions, particularly in 
view of the '/cry great variety of H3 conformations 
that we know must roast. To what extent wilt the 
roles presented here: (1) apply to structures deter- 
mined after this work was carried out? (2) provide 
a method of predicting H3 conformations in 
"blind" tests? 

The high-resolution antibody structures solved 
and deposited in the PDB suiec this analysis was 
carried out appear in Table 1 B. 

In all wso- , w vouid be pre- 

dicted, and in all eases but one a regular bulged 

t > i 1 m i t < i f t to it lu 
i\ to! minis i <• nto 

gene; usually the residue at this position is Ala 
(Cook & Tosrtlinson, 1995). At least one functional 
gt ~e (the source organism of 
hTto in t at p in i 1 ft ttltii ti I l l to! i 
V/ES (leap) i salt bri rmed b rween Lys93 

and AspIOl rather than Argfoi and Asp'itol, and 
the p-buige does not appear (Figure lb). The 
AsplOl-TrplOS hydrogen bond is present As 
noted above, this requires a generalization of the 
t i i i to i utge, 

For the apices of the bulged H3 regions, neither 
the rules of' sequence-structure correlation in hair- 
pins nor da-abase searching produced predictions 
of the conformation. 

Conclusions 

The H3 region of immunoglobulins is an import- 
ant source of antibody diversity, showing much 
greater variability in length,, sequence and confor- 
mation than the other five antigen-binding loops. 




Figure IS. The H3 region of ITER (leap; 



fl-3 Fieg<cr> ,:■! !,rim:,r ! oq!rit:.u:^S 



Analysis of H3 conformations shows thai the 
rtff.ictn proximal lo the framework has a restricted 
set of conformations, most commonly a bulged 
structure stabilised by a salt bridge between Arp/ 
Lys94 and AsplOJ, and a bvadroeen bond between 
the carbonv! O of the residue at position 100c: 
(G- 4) and the side-chain of Trpl03, In the absence 
of Arg or Lys at position 94 (or with a Lys at pos- 
ition 93 that can form an alternative suit bridge} 
the strand does not form a bulge, The confor- 
mation of the t i n, t at N-terj tin d md 
six C-termis • ■ ' rrv H3 region, can in 

almost all eases be predicted iron) the sequence of 
the region These results are in agreement with 
those of Shirai el at. {1996}. 

The conformation ot the aoex of the loop is 
determined bv many interactions: within the H3 
region itse.it, between H3 and other residues of the 
heavy chain, between H3 and the light chain, and 
between the antibody and antigen. We have classi- 
fied the structures that appear, and find that in 
many eases we can rationalize the conformation of 
the tip of tie; fid region in the known structures. 
Prediction of H3 conformation by rules derived 
here from the analysis of known structures is often 
successful. The rules for sequence- structure corre- 
lation in hairpins do not generally apply to the 
apices of hinged structures, but t 

f's>n ha i«i>fi!i In modellina toon* nrnvided that the 



stem; 



ships has reduced the uncertainty in the predict- 
able conformation to no more than a few residues 
at the tip of region. 
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